Super-Wideband Bandwidth Extension for Wideband Audio Codecs Using Switched Spectral Replication and Pitch Synthesis
نویسندگان
چکیده
This paper describes a new bandwidth extension algorithm which is targeted at high quality audio communication over IP networks. The algorithm is part of the Huawei/ETRI candidate for the ITU-T super-wideband (SWB) extensions of Rec. G.729.1 and G.718. In the SWB candidate codec, the 7-14 kHz frequency band of speech and audio signals is represented in terms of temporal and spectral envelopes. This description is encoded and transmitted to the decoder. In addition, the fine structure of the input signal is analyzed and compactly encoded. From this compact information, the decoder can regenerate the 7-14 kHz fine structure either by spectral replication or by pitch synthesis. Then, an adaptive envelope restoration procedure is employed. The algorithm operates in the MDCT domain to allow subsequent refinement coding by vector quantization of spectral coefficients. In the paper, relevant listening test results for the G.729.1SWB candidate codec that have been obtained during the ITU-T standardization process are summarized. Good audio quality could be shown for both speech and music signals.
منابع مشابه
Audio bandwidth extension using ensemble of recurrent neural networks
In audio communication systems, the perceptual audio quality of the reproduced audio signals such as the naturalness of the sound is limited by the available audio bandwidth. In this paper, a wideband to super-wideband audio bandwidth extension method is proposed using an ensemble of recurrent neural networks. The feature space of wideband audio is firstly divided into different regions through...
متن کاملArtificial Bandwidth Extension of Wideband Speech by Pitch-Scaling of Higher Frequencies
In this paper, a simple DFT-domain pitch-scaling technique is used to extend the audio bandwidth of wideband speech (50Hz – 7 kHz) to the super-wideband range (50Hz – 12 kHz). Therefore, the higher frequencies of the wideband signal (6 – 7 kHz) are pitch-scaled with a scaling factor of four and the resulting, scaled signal is inserted into the 8 – 12 kHz band. A subjective listening test has be...
متن کاملBandwidth Extension of Speech Signals: A Comprehensive Review
Telephone systems commonly transmit narrowband (NB) speech with an audio bandwidth limited to the traditional telephone band of 300-3400 Hz. To improve the quality and intelligibility of speech degraded by narrow bandwidth, researchers have tried to standardize the telephonic networks by introducing wideband (50-7000 Hz) speech codecs. Wideband (WB) speech transmission requires the transmission...
متن کاملFrom Narrowband Telephony to Wideband Telephony
The restricted audio quality of today’s telephone networks is mainly due to the narrowband (NB) limitation to the frequency range from about 300 Hz to 3.4 kHz. Meanwhile, codecs for wideband (WB) telephony (50 Hz to 7 kHz) exist with significantly improved speech intelligibility and naturalness. However, the broad introduction of wideband speech coding will require strong efforts of both networ...
متن کاملSubjective voice quality evaluation of artificial bandwidth extension: comparing different audio bandwidths and speech codecs
Artificial bandwidth extension (ABE) methods have been developed to improve the quality and intelligibility of telephone speech. In many previous studies, however, the evaluation of ABE has not fully reflected the use of ABE in mobile communication (e.g., evaluation with clean speech without coding). In this study, the subjective quality of ABE was evaluated with absolute category rating (ACR) ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010